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ABSTRACT 


The demand for effective use of water resources has increased due to ongoing global climate transformations in the 
agriculture science sector. Cost-effective and timely distributions of the appropriate amount of water are vital not only 
to maintain a healthy status of plants leaves but to drive the productivity of the crops and achieve economic benefits. 
This paper presents a novel, and non-invasive machine learning (ML) driven approach using terahertz waves with a 
swissto12 material characterization kit (MCK) in the frequency range of 0.75 to 1.1 THz in real-life digital agriculture 
interventions, aiming to develop a feasible and viable technique for precise estimation of water content (WC) in plants 
leaves on different days. For this purpose, multi-domain features are extracted from frequency, time, time-frequency 
domains using observations data to incorporate three different machine learning algorithms such as support vector 
machine, (SVM), K-nearest neighbour (KNN) and decision-tree (D-Tree). The results demonstrate SVM outperformed 
other classifiers using 10-fold and leave-one-observations-out cross-validation for different days classification with an 
overall accuracy of 98.896, 97.1596, and 96.8296 for coffee, pea-shoot, and spinach leaves respectively. In addition, 
using SFS technique, coffee showed a significant improvement of 1596, 11.996, 6.596 in computational time for SVM, 
KNN and D-tree. For pea-shoot, 21.2896, 10.0196, and 8.5396 of improvement was noticed in operating time for SVM, 
KNN and D-Tree classifiers. Lastly, in baby-spinach leaf, SVM exhibited an upgrade of 21.2896, 10.0196, and 8.5396 was 
noticed in operating time for SVM, KNN and D-Tree classifiers and which eventually enhanced the classification 
accuracy. Thus, the proposed method incorporating ML using terahertz waves can be beneficial for precise estimation 
of WC in leaves and can provide prolific recommendations and insights for farmers to take proactive actions in 
relations to plants health monitoring. 


Index terms: THz sensing, machine learning, terahertz, plant health 
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Creme (Introduction and Motivation 


" By 2050, global food production will need to increase by an estimated 70% in developed 
countries and 100% in developing countries to match current trends in population growth In 
most countries, agriculture is considered as the spine in the overall development of countries 
especially in developing countries due to its significant role in enhancing economic 
development of country. 





= Agriculture contributes approximately 20% of total greenhouse gas emission in Scotland and 
therefore makes a significant contribution to the total UK greenhouse gas emissions. 
Uncontrolled application of the pesticides, fertilisers and other resources are the major 
contributing factors 


= Standard sensors and systems have been employed to meet the huge requirement of crops 
productivity, appropriate usage of fertilizers, capable of detecting small amounts of impurities 
in soil and pathogens in plants, nutrients deficiencies in plants have not obtained prolific 
results in agriculture sectors and clearly appears to be unfeasible and unachievable. 


" This motivates our work to develop a novel and innovative approach employing nano-sensors 
and machine learning at molecular level in plants to improve reliability, enhancing the 
sensitivity in detecting the bacteria or fungus in plants with precise quantification at the early 
stage and at molecular level which would help in reducing usage of pesticides and effective use 
of fertilizers, and to produce less time consuming, portable, and cost-effective solution. 
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sensing Technologies for Precision Agriculture 


sensing technologies applied at different stages to determine the contaminated crops 








Proximal sensing Remote sensing 







* Overview of current sensor 
technologies used for the automated 
detection and identification of host- 
plant interactions. These sensors can be 
implemented in precision agriculture 
applications and plant phenotyping on 
different scales from single cells to 
entire ecosystems. Depending on the 
scale, different platforms can be 
operated and consequentially different 
plant parameters can be observed 


Research & experimental area 
for using wearable sensors to 
leaf and plants for disease 
detection at an early stage 


Detection of Detection of 
contaminated contaminated 
plants crops 


using sensor 


Early disease 
detection signs 


| 
. Cell | Leaf Plant | Small area Agricultural land 
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sea Nanosensor Device (Nano-Antenna 


Nanotechnology enables the development of nano-scale devices 
Nanosensors are the most basic nano-devices that are able to perform sensing, actuation, and reporting, 


The essential part of such a nanodevice is the communication unit. Graphene based Nano-antennas have been 


proposed to transmit signal in the THz band (0.1-10THz). 





Nano-EM 
Transceiver 





Nano Memory Nano-Antenna 


Nanoactuator 







Se E 1 um 


XX < 1 nm acre 


Nanosensors 





Nano-Power Unit 
6 um 


Terahertz (THz) 


Radio Waves Microwave Infrared ultraviolet x-ray 
FM 
Electronic dévices Photonic devices 
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|. F. Akyildiz and J. M. Jornet, "Electromagnetic 
Wirelss Nanosensor Networks," Nano 
Communication Networks, 2010. 
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Internet of Nano Things (loNT) 


* The interconnection of nanonetworks with traditional networks and the 
Internet defines a new networking paradigm, known as the Internet of 
Nano Things 


Em 


i | 
E E": 3° Party Service Providers 


© e e © 





@ Gateway @ Nano Router 
O O O Smart device, wearable sensor/device, environmental sensor © Nanomachine 
e Nano Cluster Head 





Chere Nano-communication 


Two main paradigms emerge: 
** Molecular communication 
** Electromagnetic communication 





The latest advancements in graphene-based electronics have opened the door 
to electromagnetic communication among nano-devices in the THz band. 






Electronic approach Photonic approach 
Electronk s | THz Gap | Photonics | | spp 
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PHz 
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THz gap The exploration of THz region from 100GHz to 10THz 
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e How do nano-devices communicate with each other? 
» Molecular Communication 
e Diffusion-based Molecular Communication 


e Flow-based Molecular Communicatias 





e THz Band channel is highly 

frequency selective and exhibits a 

unique distance-dependent 
bandwidth 

Terahertz frequencies are used its 

> Sonic Communication non ionization hazards for 

biological tissues, and less 

susceptibility to some of 


> and etc propagation (1.e., rayligh fading) 
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Applications of Nano-scale 


The nano-scale loNT sensory data enables new applications that are limited or unavailable in the loT e.g., 
Healthcare, Environmental, Agricultural monitoring, Military and Industry 
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Nano-Sensor on Leaves Using THz Sensing 


Internal Morphology of Leaf using Terahertz 
(THz) Sensing at Cellular level 


Water Stressed 
om | 
OA T | O | Leaf 
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Terahertz Existing technologies for Monitoring 
Health Status and Water Content of Leaves 


We 
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Water content in monitoring health status of leaves 


e In the past decade, climate Internal Morphology of Leaf using Terahertz 
transformations have predicted an (THz) Sensing at Cellular level 
increment in the occurrence of scarcity 
of water resources in many parts of 
world. 


Water Stressed 
Leaf 





* This growing deficiency of water has 
caused enormous challenges in various 
fields of plant science sector. 


Water 
e Moisture 


cells 





Composites 
(sugar or 
protein) etc 


o Pesticides 
( ) Air Cavity 





* The realization of water as an important 
and fundamental component requires in 
photosynthesis, nutritional transport, 
and to the timely growth of plant leaves. 





* |n this work, a novel and non-invasive 
technique is presented to estimate the 
water content (WC) at cellular level in 
plants leaves in using a_ terahertz 
sensing 


Fig. 1: Internal Morphology of Leaf Using Terahertz (THZ) Sensing at Cellular Level 


July, 2019 





eU Cer ARS 


of Glasgow 





Overall Approach of Proposed Technique 


Observe 
Physical 
Parameters 


Experimental 


Samples - 
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Performing 


Setup of Results 
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System 
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— Waveguide? 














Network Analyzer 
e Swissto12 THz system (0.75 to 1.1 THz) 
e A waveguide system 

ePolytetrafluoroethylene (PTFE) caps 







| A. Calibration Kit (WR — 1.0) 
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\ d e Transmission Electron Microscope (TEM) 
So 
/ . Sam ples 





' * Fresh leaves (Baby leaf, Peashoot, Spinach) 
` œ Digital scale and Vernier Calliper 
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Results 


Statistical Analysis of Leaves 


Fig. 3: Demonstration of Measuring the weight and thickness of 
leaves using digital scale and Vernier calliper respectively 


0.9 
— *— Baby Leaf 
0.8 | | | -|= â- Spinach Leaf 
== a WC = W ime m” x 100% 
_ fresh 
06 x "g! 
= It was aimed to analyse the variations of 
S os water content (WC) in all three leaves for four 
8 oat consecutive days and its effect on absorption. 
3 | | | 
S s It was noticed that baby-leaf contained a high 
0.2 volumetric WC followed by pea-shoot and 


spinach leaf. 


| | As days progressed, the weights of all leaves 
Time (day) were drastically reduced due to the 


. evaporation of WC from leaves, thus, creating 
Fig. Water content of the leaves from day bna vi enin ds 
1today 4 
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Nicolson-Ross-Weir (NRW) method 
Real and Imaginary Permittivity 





F (THz) 
Fig. Permittivity (Real and Imaginary) response of all three leaves from day 1 to 4 
using a frequency range from 0.75 THz to 1.1 THz. 





Permittivity Response of Leaves at Three Various 
Locations- 


—— Location 1 —— Location 1 
— Location 2 | | —— Location 2 
—— Location 3 —— Location 3 


RE, 





-1 
0.753 0.8 0.85 0.9 0.95 1 1.05 1.1 


Frequency (THz) 


0 
0.75 0.8 0.85 0.9 0.95 I 1.05 1.1 


Frequency (THz) 





Correlation of Real Permittivity and Loss of Water 
Content in Leaves Over four Consecutive Days 


— * — Babyleaf 
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* — Peashoot 
— & — Coffee 
* —Lettuce 
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Machine Learning Applications 


Some of the prominent and notable contributions where ML have been 
extensively used is the healthcare sector, food security, meteorology, medicine, 
meteorology, economic sciences etc 


Furthermore, researchers are very keen to discover its possibilities, specifically in 
modern agriculture systems, to enhance yield production by utilizing the water 
distribution effectively 





Cope. 1 
Ey On xen Flow Chart — An approach followed 


| 1 Observations Data | 
S For Three Leaves > 








| Data Collection and | 
\. Pre-Processmg | 





Steps Followed: 





Feature 
Extraction 








e Data Collection and Pre-Processing -- 


* Classification of all Frequency features 
(Raw Data) t 


Feature Extraction for All Different Domains 


¢ Feature Extraction EE 
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Featu re oe | ectio n Classification and 
Feature Selection 

Process 
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* Classification Model (Training and Testing) 


Classifiers Model for 


Classifiers Models for 


SVM, KNN and D-Tree SVM, KNN and D-Tree 





(Cross Validation 


P Output process ) 


(LOOCV) 


Classification Performance of Both Training 
and Testing Data Sets 
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Non-invasive Sensing Setup and 
Data Collections 


Internal Morphology of Leaves 


Internal Morphology of Leaves using Terahertz 
(THz) Sensing at Cellular level Day2 

















Day1 





Loss In molsture content (MC) and other synthesis 
at molecular level affects the freshness of leaves 
with passage of days 
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Classification Results 


Classification Accuracy for leave-one- 


observation-out cross-validation 
Classifiers Test Accuracy Frequency Time Time- 
Performance (% domain domain Frequency 

= = domain 




























Selection of Developed a Training Data 
classifification 70% 
Parameters 


Model Testing Data 
30% 


Classification of days: flow chart and identification of TRR 


From cumulative distribution function (CDF) of the probability of t-test, 

: . Data collection & Pre-processing: 
the observations in the frequency range from 0.8 THz to 1.05 THz VNA frequency-domain observations 
exhibited a significant difference with the value of probability p near to O Sp) k=12 


between the different days based on MC of fruit slice. 
] . t . Time domain THz signal 
Target Response Region (TRR) for feature extraction of classification. 
Wavelet de-noising 














Time domain denoised THz signal 
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Data Collection 
Establish a Target Response Region 
To find optimum features 


EN 
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* Feature extraction of frequency domain (10 features) 


e The observations obtained from VNA were in frequency domain form, Data collection & Pre-processing: 
which can be used to extract the frequency domain features in TRR range VNA frequency-domain observations 
directly. The variance of the Power Spectral Density (PSD) and the peak Sols) k-12 


value of Cross Power Spectral Density (CPSD) : 
Time domain THz signal 
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e Inthe TRR range, five frequency windows, D1,...,D5, with equal width of each 


frequency window were taken into. Classification performances of Training/Test sets 





* Feature extraction of time domain (11 features) 


By applying IFFT to transmission response of observations 


Data collection & Pre-processing: 
VNA frequency-domain observations 
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Feature extraction of time-frequency domain (4 features) 


Data collection & Pre-processing: 
VNA frequency-domain observations 





S) (4) k=1,2 


IFFT 


By applying STFT or Wavelet to time domain signal of observations ’ 


5 











s Time domain denoised THz signal 
S0 
o 
5 Feature Extraction 
Linear scale iz SP, Lh=25, Nf=100.5, lin. scale, imagesc, Threshold=5% 
a a 1 
o L 
9 — 
B > 
o O 
©. T 
3 y 
o k= 
5 u 0.5 
Classification and 
à Feature Selection 
ee e oe 10 20 30 40 50 60 


4 
aii SVMIKNNiDecision Tree 


Classifier Models 


Time [ps] SVM/KNN/Decision 
Tree classifiers 


Cross-Validataion (LOOCV) 


Classification performances of Training/Test sets 





Classification of days 


* Features extracted from time-, frequency-, and time-frequency domain. Sw(f) k=12 


Data collection & Pre-processing: 
VNA frequency-domain observations 
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Classification Accuracy Results for Raw Data 


Accuracy | Coffee | Peashoot | Baby 
(20) Spinach 
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Optimization and Feature Selection 


Table 4. classification Results for Coffee leaf 


Frequency Time-frequency 
cy (^ features (11 features (10 4 


| SVM | | 92.696 93.0% 91.6% 
90.0% 91.8% 89 496 
91.296 90.796 91.2% 


Coffee Leaf 
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94.46% 93.76% 91.15% 
90.64% 
91. 1396 90. 38% 89.01% 
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Table 8. Classification performance of for Coffee leaf by applying 10-fold 
validation using proposed algorithms with selected features 


Feature Serial 
Selection Classifiers Num. of gri No of KN 
Methods Features eatures o 


— 1-19, 21-25 

1-6, 8-11, 13- 
D-Tree — HE 123,24 
EN: zz 1-19,21-25 


EN | 2 — EET 


sM 971 
Relief.F 2,4,10,11,17- 95.9 


iye 96.8 





RN 


0 e as SOW Table 11: Classification performance of all classifiers by applying 10-fold 
validation using proposed algorithms with selected features 
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Feature types and feature Computation time (s) 


See era eae | 
Tree 


b o Selected Features: | | | 


Relief-F 0.6252 0.4842 0.3582 





The confusion accuracy with leave-one-observation-out cross 
validation method for each day along with monitoring the 
moisture content values for each day. 


Classifiers Test Accuracy 
Performance (% Water 
Samples | Classes Content 
DOE 


Dayı | 10 | 10 | m | si 
i 
Leaf : 


Day3 | 100 | 926 | 923 | 1234 
| Day4 | 100 | 100 | 100 | 071 
Variance 0.58 1.09 092 | | 
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Conclusion 





In this paper, a novel and non-invasive method is presented using THz 
frequency for the characterisation of the plants leaves utilizing the 
electromagnetic parameters. 


Moreover, the loss of WC is also monitored for four days consecutively. It is 
observed that the WC in leaves show a strong correlation with the permittivity. 
The average decaying response observed from day 1 to 4 in permittivity is 
attributed to the loss of WC in leaves. 


Thus, the study showed that proposed machine learning technique using 
terahertz waves pave the way for establishing a novel, robust direction for 
assessing the real-time information of estimating a water content in leaves 
non-invasively. Thus, results in the paper also demonstrated that timely 
detection of water stress in leaves could help to take proactive action in 
relation to plants health monitoring, and for precision agriculture applications, 
which is of high importance to improve the overall productivity. 
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